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Abstract 

Background: Estrogens control multiple functions of hormone-responsive breast cancer cells. They regulate diverse 
physiological processes in various tissues through genomic and non-genomic mechanisms that result in activation 
or repression of gene expression. Transcription regulation upon estrogen stimulation is a critical biological process 
underlying the onset and progress of the majority of breast cancer. ERa requires distinct co-regulator or 
modulators for efficient transcriptional regulation, and they form a regulatory network. Knowing this regulatory 
network will enable systematic study of the effect of ERa on breast cancer. 

Methods: To investigate the regulatory network of ERa and discover novel modulators of ERa functions, we 
proposed an analytical method based on a linear regression model to identify translational modulators and their 
network relationships. In the network analysis, a group of specific modulator and target genes were selected 
according to the functionality of modulator and the ERa binding. Network formed from targets genes with ERa 
binding was called ERa genomic regulatory network; while network formed from targets genes without ERa 
binding was called ERa non-genomic regulatory network. Considering the active or repressive function of ERa, 
active or repressive function of a modulator, and agonist or antagonist effect of a modulator on ERa, the 
ERa/modulator/target relationships were categorized into 27 classes. 

Results: Using the gene expression data and ERa Chip-seq data from the MCF-7 cell line, the ERa genomic/non- 
genomic regulatory networks were built by merging ERa/ modulator/target triplets (TF, M, F), where FF refers to the 
ERa, M refers to the modulator, and T refers to the target. Comparing these two networks, ERa non-genomic network 
has lower FDR than the genomic network. In order to validate these two networks, the same network analysis was 
performed in the gene expression data from the ZR-75.1 cell. The network overlap analysis between two cancer cells 
showed 1% overlap for the ERa genomic regulatory network, but 4% overlap for the non-genomic regulatory network. 

Conclusions: We proposed a novel approach to infer the ERa/modulator/target relationships, and construct the 
genomic/non-genomic regulatory networks in two cancer cells. We found that the non-genomic regulatory 
network is more reliable than the genomic regulatory network. 



Background nuclear receptors identified [1-3]. Estrogen receptor (ER) 

Nuclear receptors (NR) are a superfamily of ligand- is a member of the nuclear receptor superfamily and is 

activated transcription factors that modulate specific gene categorized into the class of ligand-dependent steroid 

expression by interacting with specific DNA sequence receptor in the 1960s. The study explained it controls 

upstream of their target gene. So far there are over 100 diverse biological processes by mediating the actions of 

steroid hormone estrogen and afforded an appreciation of 
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cells. Finally, the general pathway for steroid hormone 
action was subsequently elucidated [4]. 

Unlike conventional transcription factors, ER is com- 
posed of several domains including ligand binding, DNA 
binding, dimerization, and transcriptional activation. The 
ligand binding domain participates in several activities 
including hormone binding, homo- and/or heterodimeri- 
zation, and transcriptional activation and repression. The 
binding of the estrogen induces conformational changes 
in ER that could regulate gene expression by directed 
interaction with DNA (genomic pathway of ER action) or 
via an undirected connection with the modulation of 
some specific proteins (non-genomic pathway) [5,6]. 

In a gene regulatory network, gene transcription varia- 
tions are controlled by many transcription factors. It has 
been established that the presence of regulatory sequences 
is in the proximity of genes and the existence of proteins 
is able to bind to those elements and to control the activity 
of genes by either activation or repression of transcription 
[7]. To understand gene regulation, the inference of its 
regulatory network is an important research topic [8]. 
Recent genomic technology, such as genome wide expres- 
sion array or sequencing, allows us to elucidate the global 
gene regulatory mechanisms. Due to the well-developed 
microarray technology, the wealthy information for gene 
expression allows us to observe the expression levels of 
thousand of gene at once and helps more accurately pre- 
dict gene-to-gene interaction according to its similarity or 
dissimilarity. 

One approach to establish the gene regulatory network 
is to start from gene-gene correlations or interactions. 
Many computational approaches have been developed 
aimed to measure associations between mRNA abundant 
profiles to predict the transcriptional regulatory interac- 
tion. Some attempts at determining gene regulation 
based on the gene expression clustering algorithm. They 
group the genes that show similar gene expression using 
correlation coefficient matrix [9] or mutual information- 
based algorithm [10,11] under the same condition [8,9]. 
However, clustering the resembling genes that are co- 
regulated cannot present much more information about 
the biological mechanisms of gene regulation or regula- 
tory pathway. Thus, some computational algorithms are 
proposed to reconstruct the gene networks by applying 
statistical approaches, such as Relevance Network, Baye- 
sian Network, Linear Regression Network [12], and our 
own Regulation Network [3]. 

Relevance Network detects the relatedness between 
two genes from their gene expression profiles and gives a 
link between transcription factor and its target gene if 
correlated [13-17]. The typical methods to calculate the 
relatedness are Pearson Correlation Coefficient and 
Mutual information. Pearson Correlation Coefficient pro- 
vides better performance on detecting linear relationships 



but it is not as intuitive as the Euclidean distance mea- 
sure [17]. Mutual information (MI) gives good perfor- 
mance on non-linear relationship. For example, 
ARACNE algorithm [16] estimates the mutual informa- 
tion between the gene expressions of two genes using 
Gaussian kernel estimator. The measure of relatedness 
by MI ranges from 0 to 1. Relevance network is a rela- 
tively simple model, which computes the pair-wise simi- 
larity or dissimilarity between two genes. 

Bayesian Network (BN) can identify casual relation- 
ships between variables. The topology a BN can provide 
the dependence or independence of variable [18], BN 
algorithm can reveal the dynamics of the gene regulation 
hierarchy. While BN has its advantage of structure 
model, it is difficult to inform whether a node (gene) is 
important to be included. Another challenging is its com- 
putational stability. It usually results in multiple optimal 
networks [19]. The high computational requirement 
leads to almost impossible of inference to a large-scale 
regulatory network [20]. Also, BN assumes no gene-gene 
interaction, which can misrepresent the data. 

Our proposed ERa regulatory network is a combina- 
tion of TF binding affinity estimated from ChlP-seq 
data, up or down regulation using gene expression, and 
motif conservation in probe sequences. This approach 
effectively utilized the genomic or non-genomic actions. 
Unlike previous regression approaches, this method did 
not use correlation information. 

In this paper, our proposed approach analyzes the inter- 
action between TF and target gene conditioned on a 
group of specific modulator genes. Also, we consider the 
change of modulators' expression level to perceive its 
influence on transcriptional activity. We reconstruct gene 
regulatory networks in related biological subjects via a 
multiple linear regression approach with interaction term 
such that the inferred modulator gene is directiy embodied 
and the relationships of the biological subjects they repre- 
sent are easily exploited. As a result, this reveals deeper 
insight on how the structure, function, and behaviour of 
components evolve. 

Method 

mRNA gene expression profiling data preprocessing 

mRNA expression profiling for ERa was performed as 
previously described [2] . The microarray data for ERa in 
MCF-7 and ZR-75.1 cells treated with E2 were obtained 
from Experiment E-TABM-742 (http://www.ebi.ac.uk/ 
arrayexpress/experiments/E-TABM-742). In this dataset, 
gene expression profile of human estrogen-responsiveb- 
breast cancer cell lines ZR-75.1 and MCF-7 treated with 
10~ 8 M of 17 p -estradiol (E2) on total RNA extracted 
before or after 1, 2, 4, 6, 8, 12, 16, 20, 24, 28 and 32 hours 
hormonal stimulation. The microarray data were prepro- 
cessed by BeadStudio Software version 3.2 with quantile 
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normalization. Data were log2 transformed and the ratio 
of each signal against average reference signal were calcu- 
lated [2]. There are 48702 probes in total. 

More quality control analyses were performed. A 
probe is considered as absent if it is called absent in 
every time point by the BeadStudio Software, and these 
absent probes were excluded. Then, gene expression 
was averaged from the existing duplicate present probes. 
Genes with the small coefficient of variance are also 
removed. Using the threshold of 0.15, there were 6418 
genes in MCF-7 cell and 13872 genes for ZR-75.1 cell 
left for the network analysis. 

ChlP-seq data analysis 

ERa ChlP-seq data were prepared by Tim H. M. Huang's 
lab and generated for MCF-7 cell before and after E2 
treatment 0.5, 1, and 24 hours. Sequencing was con- 
ducted with Illumina Genome Analyzer II (GA II) as per 
manufacturer's instructions. Reads are organized into a 
contiguous assembly of 24 different strands (one for each 
chromosome) and mapped to human genome reference 
sequence (HG18) using ELAND provided by Illumina. 
Four published peak-calling algorithms were applied to 
call the ERa binding sites: including CisGenome [21], 
GeneTrack [22], MACS [23], and SISSRS [24] with FDR 
= 0.001 to predict ERa binding peaks. 

Linear regression model with interaction term 

A linear regression model (1), with an interaction term 
is constructed to describe the relationship among TF 
(ERa), M (modulator), and T (target). 



T = ai + a2-M + biTF + \>2TF x M + e, 



(1) 



where (TF, T) are all Jxl vectors gene expression, and J 
is the total sampling time of gene expression samples; M 
is a binary variable (0 means low, and 1 means high) 
derived by the expression of a modulator's expression 
divided by its median; {a lt a 2 , b lt b 2 ) are regression coef- 
ficients. For parameter setting, we did try the continuous 
scale for M at the beginning, and found that it often 
times it was highly sensitive to its skewed distribution 
because of small sample size. Therefore, a binary M is 
more robust choice. When the expression of modulator 
is high (M = 1), Eq. (1) becomes T = (ai + a 2 ) + (bi + b 2 ) 
TF; otherwise, T = ai +bi7T when M = 0. Parameter & 2 
presents the change of T in response to TF influenced 
by M. If bi is not statistically significant, the data do not 
have enough evidence to support M modulates the TF's 
effect on the target T. 

Schematic representation 

Using a linear regression model with the interaction 
term, the regression coefficients (a 2 , bi, b 2 ) define the 



functional relationship of the triplicate (T, TF, M). 
Eq. (2) defines the correlation indicator for all three 
regression parameters for any triplicate data analysis. 
The significant means p-value is less than 0.05. 



Correlation lndicator(Cl) ■■ 



+, significant with positive value 

—, significant with negative value (2) 

0, not significant 



A schematic overview of the triplicate network is 
shown in Figure 1. There are two lines between TF and 
T. These two lines mean two types of network construc- 
tion according to the rule of modulator in the network. 
The solid line stands for the direct influence on the 
relationship between TF and T, which is independent of 
M; while the dashed line represents that the relationship 
of TF on T depends on M. Based on the criterion of 
correlation indicator (CI) described above, there are 27 
categories of network behaviours from all combinations. 

Modulator gene candidates 

We focus our modulators on specific molecular func- 
tional classes. Using Gene Ontology (GO) molecular 
function, several functional classes are included: protein 
kinase activity, phosphoprotein phosphatase activity, 
acetyltransferase activity, deacetylase activity, methl- 
transferase activity, transcription factors, and transcrip- 
tional cofactors. In these 7 specific GO molecular 
functions, 485 unique modulator genes are found from 
the pool of 6418 presented genes in MCF-7 cell. The 
result of GO analysis is shown in Table 1. 

ERa genomic and non-genomic target gene selection 

Using ChlP-seq analysis, only genes that have ERa bind- 
ings at 0, 0.5, 1, and 24 hours are considered genomic 
target. Four peak finding algorithms are used in predict- 
ing the binding sites: CisGenome, GeneTrack, MACS, 
and SISSRS with FDR = 0.001. Motif Conservation Score 
(MCS) [25] of each binding site overlapping with the 



b,= 0, b 2 = + - * Activator Agonist 
b,= 0, b 2 = - *-» Repressor Agonist 




Figure 1 Linear regression model. A schematic overview of the 
triplicate network: Based on the criterion of correlation indicator (CI) 
described above, there are 27 categories (TF/M/T relationships) of 
network behaviours from all combinations. 
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Table 1 Gene ontology table for modulator genes 



G0:ID 



# of unique 
modulator 



Signalling protein 






Protein kinase activity 


GO:0004672 


162 


Phosphoprotein phosphatase 


GQ0004721 


43 


activity 






Acetyltransferase activity 


GQO0 16407 


8 


Deacetylase activity 


GQ0019213 


1 


Methltransferase activity 


GO:0008168 


57 


TF protein 






Transcription factor activity 


GQ0030528 


156 


Transcription cofactor activity 


GO:0003712 


58 


Total 




485 



known genes from the 6418 gene profiles is calculated, 
and a 95% conservation score is chosen as the threshold. 
As a consequence, 56 genes appearing in the results of 4 
diverse algorithms can regard as the reliable target gene 
candidates in ERa genomic network. For the non-geno- 
mic network, we didn't treat all the other genes as non- 
genomic targets. As a matter of fact, only genes who 
don't have any binding signals with all the ChlP-seq peak 
finding tools in all four time points were selected as the 
non-genomic targets. There are 5877 genes not appearing 
in the results of 4 diverse algorithms. 

Network construction 

Using the triplicates generated from the linear model 
with the interaction, a network model can be con- 
structed. This initial network will include an enormous 
amount of modulators which have only limited targets. 
Therefore, a filtering threshold is developed to keep 
only modulators with significant number of targets. In 
this network analysis, all the connections among the 
modulators, targets, and the TF are assumed at random, 
though the total number of connections is fixed as the 
observed number from our previous analysis. After 1000 
times shuffling in the network connections, a distribu- 
tion of target number of any modulator is formed, and 
95% threshold is chosen for the modulator selection. 

ERa regulatory network construction flowchart 

The ERa regulatory network construction goes through 
data pre-processing, modulator selection, genomic target/ 
non-target selection, linear regression with interaction, 
and network construction. Figure 2 shows an integration 
of analytic workflow for the proposed method. 

Results 

ERa/modulator categories 

Using ChlP-seq data, we predicted 56 ERa genomic tar- 
gets and 5877 non-genomic targets. Together with our 



Data Preprocessing 

■«70ieretwilfomMa7 wd It n 1 Gf nc (.prcinonOata 
•Rcrncm 'UvmMHT prMmtrr P 

• [ Kludc un.nloimjIiM probn *Hlh |ST0/Mun) < 0 IS 




Target Gene Selection 



Gene Regulatory Network Analysis by 
Linear Regression Model 



Figure 2 The analytic workflow of gene regulatory network 
analysis. An integration of analytic workflow for the proposed 
method is proposed. GRN construction goes through data pre- 
processing, modulator selection, genomic target/non-target 
selection, linear regression with interaction, and network 
construction. 



pre-specified modulators, we generated 22 (T, TF, M) 
categories that involve the interactions with modulator 
genes. Many categories have FDR < 10% in both geno- 
mic and non-genomic (T, TF, M) categories. In particu- 
larly, genomic regulatory categories have higher FDRs 
than the non-genomic categories. 

The following are four highlighted examples. Figure 3 
(a,b) show the modulator genes mediated the transcrip- 
tion activities, which enhance and repress the expression 
level of their corresponding target genes. Figure 3(c,d) 
illustrates that CDK6 possesses the indirect influence on 
target genes according to the diverse level of its gene 
expression. Figure 3(c) represents CDK6 is inferred as 
an activator agonist of TF (ERa). Conventionally, ERa has 
no function on target gene (LOC652683). When the 
expression level of CDK6 comes to high, it stimulates ERa 
to turn into an activator to target gene (LOC652683). On 
the contrary, CDK6 owns the opposite capability to ERa 
since TF which makes no impact on target gene (CHD9) 
when CDK6 expression is high, which are illustrated in 
the Figure 3(d). It is implicated that each modulator gene 
can stimulate TF to either activate or repress a large num- 
ber of targets, depending on the cellular context. 

ERa regulatory network analysis 

Once a biological process is represented by a network, the 
analysis of network topology uncovers the functional orga- 
nization and unknown organizing principles of cellular 
systems [26]. Many network researches investigate net- 
work activity for an active node by using the concept of 
degrees, defined as the number of edges adjacent to the 
neighbours. As a node has larger or higher connectivities, 
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(a) Class 7 : TF Activator and M Agonist 
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(b) Class 8: TF Activator and M Antagonist 
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(d) Class 22: M repressor Agonist 



Figure 3 Examples of the behavior of gene interaction. 4 highlight examples of inferred gene interactions, (a) and (b) shows the modulator 
genes mediated the transcription activities, which enhance and repress the expression level of their corresponding target genes, (c) and (d) show 
CDK6 possesses the indirect influence on target genes according to the diverse level of its gene expression, which stimulate the TF activity. 



it represents to be the important connector and has more 
impact on the signalling pathway. For this reason, we 
zoom in our scope into a small-scale ERa regulatory 
networks. 



To concentrate on those modulators with a large 
number of targets, ERa regulatory networks are con- 
structed from the triplicates from Table 2. The thresh- 
olds on both MCF-7 and ZR-75.1cell were chosen based 



Table 2 Statistical results for modulator-TF-target Interaction 















ERa genomic 




ERa non-genomic 






Classification 


a 2 


b, 


b 2 


# of unique # of 


FDR 


# of unique 


# of 


FDR 












M/T 


connection 


(%) 


M/T 


connection 


(%) 


1 


TF (Ind. Activator) 


0 


+ 


0 














2 


TF (Ind. Repressor) 


0 




0 














3 


TF (Ind. Activator) and M (Ind. Activator) 


+ 


+ 


0 


58/21 


67 


34.870 


367/2590 


6447 


1.906 


4 


TF (Ind. Repressor) and M (Ind. Activator) 


+ 




0 


0/0 


0 


NA 


0/0 


0 


NA 


5 


TF (Ind. Activator) and M (Ind. Repressor) 




+ 


0 


36/14 


40 


16.688 


306/1 035 


3230 


~ 0 


6 


TF (Ind. Repressor) and M (Ind. Repressor) 






0 


0/0 


0 


NA 


144/88 


349 


~ 0 


7 


TF (Activator) and M (Agonist) 


0 


+ 


+ 


185/21 


307 


14.133 


419/2372 


71465 


0.587 


8 


TF (Activator) and M (Antagonist) 


0 


+ 




282/47 


908 


3.676 


470/4875 


71166 


0.824 
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Table 2 Statistical results for modulator-TF-target Interaction (Continued) 



Q 

y 


TF (Repressor) and M (Antagonist) 


U 




+ 


y4/z 


yo 


U 


ZDO/ ZUD 


C 1 

D I ZD 


.zoy 


10 


TF (Repressor) and M (Agonist) 


0 


- 




0/0 


0 


NA 


0/0 


0 


NA 


11 


TF (Activator) and M (Agonist & Ind. 
Activator) 


+ 


+ 


+ 


0/0 


0 


NA 


0/0 


0 


NA 


12 


TF (Activator) and M (Antagonist & Ind. 
Activator) 


+ 


+ 




284/51 


849 


1.572 


477/5204 


78213 


0.206 


13 


TF (Repressor) and M (Antagonist & Ind. 
Activator) 


+ 




+ 


0/0 


0 


NA 


0/0 


0 


NA 


14 


TF (Repressor) and M (Agonist & Ind. 
Activator) 


+ 




- 


0/0 


0 


NA 


0/0 


0 


NA 


1 ^ 
1 J 


TF (Activator) and M (Agonist & Ind. 
Repressor) 




+ 


+ 


171 111 
LI LI LL 


0.3/1 


1 (Y71 


A 1.1 11 1 71 
4 dLI L\ 1 I 


^ 1 n^/i 

D 1 U04 




16 


TF (Activator) and M (Antagonist & Ind. 
Repressor) 




+ 




0/0 


0 


NA 


0/0 


0 


NA 


17 


TF (Repressor) and M (Antagonist & Ind. 
Repressor) 


- 


- 


+ 


23/2 


26 


12.837 


332/442 


8152 


~ 0 


18 


TF (Repressor) and M (Agonist & Ind. 
Repressor) 


- 


- 


- 


0/0 


0 


NA 


0/0 


0 


NA 


19 


M (Ind. Activator) 


+ 


o 


o 














20 


M (Ind. Repressor) 




0 


0 














21 


M (Activator Agonist) 


0 


0 


+ 


261/37 


835 


4.797 


418/3619 


118241 


0.479 


22 


M (Repressor Agonist) 


0 


0 




42/8 


65 


1 5.404 


213/977 


5546 


1.709 


23 


M (Activator Agonist & Ind. Activator) 


+ 


0 


+ 


0/0 


0 


NA 


0/0 


0 


NA 


24 


M (Repressor Agonist & Ind. Activator) 


+ 


0 




23/9 


34 


19.633 


354/1453 


6119 


0.951 


25 


M (Activator Agonist & Ind. Repressor) 




0 


+ 


277/40 


638 


2.093 


440/3852 


80644 


0.336 


26 


M (Repressor Agonist & Ind. Repressor) 




0 




0/0 


0 


NA 


0/0 


0 


NA 


27 


No Function 


0 


0 


0 















on the data itself, not on heuristics. A threshold of 18 
targets for MCF-7 cell was chosen for modulator selec- 
tion with a 0.1 level of FDR. Figure 4 and 5 visualize the 
ERa genomic and non-genomic network. ERa genomic 
regulatory network comprises of 85 modulators and 56 
target genes. On the other hand, there are 25 modula- 
tors and 87 target genes in ERa non-genomic network. 
The gene marked with green rectangle represents modu- 
lator gene and the gene with pink circle is target gene. 
The size of rectangle or circle differentiates the degree 
of connections. The red colour line stands for the inter- 
action between modulator and transcription factor, and 
blue line means the transcription activity between a pair 
of genes. 

Validation studies with ZR-75.1 

In the previous studies [27,28], both ERa-positive, hor- 
mone-responsive MCF-7 and ZR-75.1 cell had been well 
established to investigate the molecular and genomic 
mechanisms mediating hormonal control of cell function 
in vitro. Evidences in literature showed that some known 
genes in MCF-7 and ZR-75.1 cells are identified to have 
identical kinetic and type of response to hormone in 
breast cancer model after E2 stimulation. To investigate 
the target genes whose expression profile were signifi- 
cantly modified and the modulator genes who mediate 



the transcription activities of ERa action, gene expression 
data of ZR-75.1 was applied to our regulatory model to 
validate the results of MCF-7. It yielded (47/4799) over- 
lapped triplets for genomic network and (12/270) triplets 
for non-genomic network, where 4799 and 270 are the 
number of triplets from the results of MCF-7. Using 
these overlapping triplets, ERa genomic/nongenomic 
network are exhibited in Figure 6. 

For the results of the overlapping networks, the statis- 
tical significance of comparing the ERa network between 
two cancer cells is not our primary interests. As we have 
shown in our previous work, the overlap between the 
two gene regulatory networks in two different time 
points after E2 stimulation was very small even in a sin- 
gle cell line. We don't expect a significant overlap 
between two networks between two cells. The simply 
want to know what are the overlapped components and 
non-overlapped components. This description itself is 
very valuable. In addition, both GO and KEGG don't 
have estrogen signalling pathways, and we don't feel 
these analyses will add much to our understanding of 
the estrogen regulatory network. 

Conclusions 

This paper proposed a regression model based approach 
in ERa regulatory network model construction. It 
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characterizes the interaction between ERa and its mod- are able to construct ERa genomic and non-genomic 
ulators and their associated gene targets. With addi- regulatory models. Comparing these two networks, ERa 
tional ERa binding information from ChlP-seq data, we non-genomic network has lower FDR than the genomic 
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(a) Genomic network 




(77 



(b) Non-Genomic network 

Figure 6 Overlapping network between MCF-7 and ZR-75.1 cells, (a) shows the overlapping genomic network between two cell lines. It 
consists 26 modulator and 71 target genes, (b) is the overlapping non-genomic network. 
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network. This was validated by the same network analy- 
sis on both ZR-75.1 and MCF-7 cells. 



Acknowledgements 

Based on "An ERa/modulator regulatory network in the breast cancer cells", 
by Heng-Yi Wu, Pengyue Zheng, Guanglong Jiang, Yunlong Liu, Kenneth P 
Nephew, Tim HM Huang and Lang Li which appeared in Genomic Signal 
Processing and Statistics (GENSIPS), 2011 IEEE International Workshop on. <S 
2011 IEEE [29]. 

This work is supported by the U.S. National Institutes of Health grants R01 
GM74217 (L. L), AA017941 (Y.L.), CA1 13001 (T.H-M.H. and K.P.N), and U-01 
GM61373 (D.F.). 

This article has been published as part of BMC Genomics Volume 13 
Supplement 6, 2012: Selected articles from the IEEE International Workshop 
on Genomic Signal Processing and Statistics (GENSIPS) 2011. The full 
contents of the supplement are available online at http7/www. 
biomedcentral.com/bmcgenomics/supplements/13/S6. 

Author details 

'Center for Computational Biology and Bioinformatics, Indiana University, 
Indianapolis, IN, USA. Indiana University School of Medicine, Indianapolis, IN, 
USA. 3 Medical Sciences, Indiana University School of Medicine, Bloomington, 
IN, USA. department of Molecular Virology, Immunology, and Medical 
Genetics, The Ohio State University, Columbus, OH, USA. 

Authors' contributions 

HYW designed the method and drafted the manuscript along with LL. PZ 
and GJ dealt with data processing. LL also critically revised the manuscript 
for important intellectual content. YL, KPN, THMH and LL supervised the 
work and gave final approval of the version of the manuscript to be 
submitted. 

Competing interests 

The authors declare that they have no competing interests. 
Published: 26 October 2012 

References 

1. McKenna NJ, Lanz RB, O'Malley BW: Nuclear receptor coregulators: cellular 
and molecular biology. Endocr Rev 1999, 20(3):321-44. 

2. Cicatiello L, Mutarelli M, Grober OM, et ah Estrogen receptor alpha 
controls a gene network in luminal-like breast cancer cells comprising 
multiple transcription factors and microRNAs. Am J Pathol 2010, 
176(5)2113-30. 

3. Shen C, Huang Y, Liu Y, et al: A modulated empirical Bayes model for 
identifying topological and temporal estrogen receptor alpha regulatory 
networks in breast cancer. BMC Syst Biol 201 1, 5(1):67. 

4. Rollerova E, Urbancikova M: Intracellular estrogen receptors, their 
characterization and function (Review). Endocr Regul 2000, 34(4)203-18. 

5. Acconcia F, Kumar R: Signaling regulation of genomic and nongenomic 
functions of estrogen receptors. Cancer Lett 2006, 238(1 ): 1 -1 4. 

6. Bjornstrom L, Sjoberg M: Mechanisms of estrogen receptor signaling: 
convergence of genomic and nongenomic actions on target genes. Mol 
Endocrinol 2005, 19(4):833-42. 

7. Schlitt T, Brazma A: Current approaches to gene regulatory network 
modelling. BMC Bioinformatics 2007, 8(Suppl 6):S9. 

8. Liang KC, Wang X: Gene regulatory network reconstruction using 
conditional mutual information. EURASIP J Bioinform Syst Biol 2008, 253894. 

9. Eisen MB, Spellman PT, Brown PO, et al: Cluster analysis and display of 
genome-wide expression patterns. Proc Natl Acad Sci USA 1998, 
95(25):14863-8. 

10. Z X, W X, D RE: Construction of genomic networks using mutual- 
information clustering and reversible-jump Markov-chain-Monte-Carlo 
predictor design. Signal Processing 2003, 83(4):15. 

1 1. Zhou X, Wang X, Dougherty ER, et al: Gene clustering based on 
clusterwide mutual information. J Comput Biol 2004, 1 1 (1):147-61 . 

12. Fu Y, Jarboe LR, Dickerson JA: Reconstructing genome-wide regulatory 
network of E. coli using transcriptome data and predicted transcription 
factor activities. BMC Bioinformatics 201 1, 12:233. 



13. Butte AJ, Kohane IS: Mutual information relevance networks: functional 
genomic clustering using pairwise entropy measurements. Pac Symp 
Biocomput 2000, 418-29. 

14. D'Haeseleer P, Wen X, Fuhrman S, et al: Mining the gene expression 
matrix: inferring gene relationships from large scale gene expression 
data. In Second International Workshop on Information Processing in Cells 
and Tissues 1998, 203-212. 

15. Kaderali L, Radde N: Inferring Gene Regulatory Networks from Expression 
Data.33-74. 

16. Margolin AA, Nemenman I, Basso K, et al: ARACNE: an algorithm for the 
reconstruction of gene regulatory networks in a mammalian cellular 
context. BMC Bioinformatics 2006, 7(Suppl 1):S7. 

17. Priness I, (Vlaimon O, Ben-Gal I: Evaluation of gene-expression clustering 
via mutual information distance measure. BMC Bioinformatics 2007, 8:111. 

18. Zou M, Conzen SD: A new dynamic Bayesian network (DBN) approach 
for identifying gene regulatory networks from time course microarray 
data. Bioinformatics 2005, 21(1):7l-9. 

19. Uusitalo L: Advantages and challenges of bayesian networks in 
environmental modelling. Ecological Modelling 2007, 203(3):7. 

20. Parikh AP, Wu W, Curtis RE, et al: TREEGL: reverse engineering tree- 
evolving gene networks underlying developing biological lineages. 
Bioinformatics 201 1, 27(13)1196-204. 

21. Ji H, Jiang H, Ma W, et al: Using CisGenome to analyze ChlP-chip and 
ChlP-seq data. Curr Protoc Bioinformatics 201 1, Chapter 2:Unit2 13. 

22. Albert I, Wachi S, Jiang C, et al: GeneTrack-a genomic data processing 
and visualization framework. Bioinformatics 2008, 24(1 0):1 305-6. 

23. Zhang Y, Liu T, Meyer CA, et al: Model-based analysis of ChlP-Seq (MACS). 
Genome Biol 2008, 9(9):R137. 

24. Narlikar L, Jothi R: ChlP-Seq data analysis: identification of protein-DNA 
binding sites with SISSRs peak-finder. Methods Mol Biol 2012, 802:305-22. 

25. (VlcLeay RC, Leat CJ, Bailey TL: Tissue-specific prediction of directly 
regulated genes. Bioinformatics 201 1, 27(17)2354-60 

26. Junker BH, Schreiber F: Analysis of Biological Networks. John Wiley & Sons; 
2008. 

27. Lacroix M, Leclercq G: Relevance of breast cancer cell lines as models for 
breast tumours: an update. Breast Cancer Res Treat 2004, 83(3):249-89. 

28. Zhu Y, Wang A, Liu MC, et al: Estrogen receptor alpha positive breast 
tumors and breast cancer cell lines share similarities in their 
transcriptome data structures. Int J Oncol 2006, 29(6):1 581 -9. 

29. Wu H, Zheng P, Jiang G, Liu Y, Nephew K, Huang THM, Li L: An ERa/ 
modulator regulatory network in the breast cancer cells. Genomic Signal 
Processing and Statistics (GENSIPS), 2011 IEEE International Workshop on: 4-6 
December 2011 2011,5-8. 



doi:1 0.1 1 86/1 471 -21 64-1 3-S6-S6 

Cite this article as: Wu et al: A modulator based regulatory network for 
ERa signaling pathway. BMC Genomics 2012 13(Suppl 6):S6. 



Submit your next manuscript to BioMed Central 
and take full advantage of: 

• Convenient online submission 

• Thorough peer review 

• No space constraints or color figure charges 

• Immediate publication on acceptance 

• Inclusion in PubMed, CAS, Scopus and Google Scholar 

• Research which is freely available for redistribution 



Submit your manuscript at 
www.biomedcentral.com/submit 



BioMed Central 



